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Title: Newly identified cholinephosphotransferases and 
ethanolaminephosphotransferases 

The present invention relates to polypeptides, which transfer 
phosphocholine and phosphoethanolamine and the newly identification 
thereof, nucleotide sequences coding for any of these polypeptides and 
5 nucleotide sequences complementary thereto, plasmids, vectors and a 

(micro)organism or cell comprising said nucleotide sequences. Furthermore, 
the present invention relates to processes to produce 
cholinephosphotransferases and ethanolaminephosphotransferases like 
sphingomyelin synthase, ethanolamine phosphorylceramide synthase, 

10 phosphatidylcholineiglycoprotein cholinephosphotransferase and 

phosphatidylcholine:glycolipid cholinephosphotransferase. The present 
invention also provides the application of said nucleotide sequences to 
influence the equilibrium reactions or to develop compounds influencing the 
equilibrium reactions wherein said transferases are involved and the 

15 application of said compounds in medical use. Finally, a process has been 

provided to isolate candidates for functional genes of a previously unidentified 
enzyme from a huge database. 

Cholinephospho (Ch-P) transferases and 
ethanolaminephosphotransferases catalyze phosphosphingolipid synthesis in 

20 animals and parasites. Sphingomyelin (SM) is an abundant constituent of 

cellular membranes in a wide range of organisms, from mammals (Ullman and 
Radin, 1974) and nematodes (Satouchi et al., 1993) to protozoa like the human 
malaria parasite, Plasmodium falciparum (Elmendorf and Haldar 1994). SM is 
preferentially concentrated in the outer leaflet of the plasma membrane. Its 

25 high packing density and affinity for sterols help provide a rigid barrier to the 
extracellular environment and play a role in the formation of lipid rafts, 
specialized areas in cellular membranes with important functions in signal 
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transduction and membrane trafficking (Simons and Toomre, 2000; Holthuis 
et aL, 2001). Since the discovery of the 'SM cycle 5 as a putative signaling 
system analogous to well-known second messenger systems lik e the 
phosphoinositide pathway, SM has emerged to the focus of interest in many 
5 research laboratories (Kolesnick and Hannun, 1999; Andrieu-Abadie and 
Levade, 2002). 

Since the enzyme SM synthase is also able to catalyse the reverse 
reaction, namely the formation of PC from SM and DAG (Marggraf and 
Kanfer, 1984, van Helvoort et aL, 1994) it may regulate, in opposite directions, 

10 the cellular levels of the bioactive lipids ceramide and DAG. The latter two are 
important regulators of membrane trafficking, cell proliferation and apoptosis 
(Kolesnic and Hannun, 1999; Bankaitis, 2002; Brose and Rosenmund, 2002, 
Pettus et aL, 2002). Hence, one may anticipate that the physiological 
significance of SM synthase goes beyond the formation of SM. 

15 The subcellular localisation of the enzyme has been the subject of 

numerous studies. After the initial debate had focused on whether SM 
synthase is located in the Golgi or at the plasma membrane (Marggraf et al.; 
Voelker and Kennedy, 1982; Lipsky and Pagano, 1985), subsequent work revealed 
evidence for the presence of SM synthase activity in both membranes (Futeiman 

20 et al., 1990; Jeckel et al., 1990; van Helvoort et al., 1994). Whether SM synthesis 
detected at these locations is due to the presence of more than one isoform of 
the enzyme has remained an open issue. 

Progress in understanding the biological roles of SM synthesis and its 
regulation is hampered by the fact that no successful purification of the 

25 responsible enzyme has been achieved. Recent work has identified a bacterial 
SM synthase released from Pseudomonas aeruginosa (Luberto et al., 2003). 
However, this activity is a soluble protein, hence in contrast to the animal 
enzyme that is tightly associated with the membrane (Ullman and Radin, 1974; 
Voelker and Kennedy, 1982). Other efforts focused on the isolation of SM synthase 

30 mutants by screening Chinese hamster ovary cells for resistance to a SM- 
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directed cytolysin (Hanada et al., 1998). Instead of mutants with a primary defect 
in SM synthase, this method yielded mutants defective in serine 
palmitoyltransferase activity or blocked in ceramide transport to the site of SM 
synthesis (Fukasawa et al., 1999). 
5 In the light of the prior art it is therefore highly desirable to identify and 

purify SM synthases, the composing amino acid sequence and microorganisms 
producing the same based on a process comprising the expression of nucleotide 
sequences coding for the amino acid sequences. 

10 Definitions 

Throughout this application the following definitions will be used: 

- homology: sharing an ancestral molecule 

- identity: extent to which amino acid sequences are invariant, in other words 
15 unchanged 

- "conservative" or "non-conservative" mutation: replacement of one amino acid 
by another, which does not change the enzyme function upon alignment (lining 
up sequences to achieve maximum levels of identity and conservation) 

- similarity: percentage of identical amino acids or amino acids with similar 
20 properties between two sequences. 

Abbreviations 

Throughout this application the following abbreviations will be used: 

25 

Human (Hu) 

Caenorhabditis elegans or C. elegans (Ce) 
Drosophila melanogaster (Dm) 
Saccharomyces cerevisiae (Sc), 
30 Schizosaccharomyces pombe (Sp) 
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Candida albicans (Ca) 
sphingomyelin (SM) 
sphingomyelinase (SMase) 
phosphatidyl choline (PC) 
diacyl glycerol (DAG) 
ceramide (CER) 



N-Acyl-D-sphingosine-l-phosphocholine 



l,2-Diacyl-sn-glycero-3-phosphocholine 
sn-l,2-Diacyl glycerol 
N-Acyl-D-sphingosine 



10 



15 



20 



inositol phosphorylceramide (IPC) 
lipid phosphate phosphatase (LPP) 

N-(7-(4-nitrobenxo-2-oxa-l,3-diazole)aminocaproylsphingosine (NBD-CER) 
2V-ethylmaleimide (NEM) 

electrospray ionization tandem mass spectrometry (ESI-MS/MS) 

phytoceramide-based SMs (phytoSM) 

phosphorylcholine (Ch-P) 

cytidine-5 1 diphosphocholine -choline (CDP-Ch) 

tubovesicular membranes (TVM) 

phosphatidylethanolamine (PE) 

phosphatidylinositol (PI) 

phosphatidylserine (PS) 

phosphatide acid (PA) 

phosphatidylglycerol (PG) 

open reading frame (ORF) 

CSS1, CSS2 and CSS3 (for candidate SM synthase family 1-3), 

with human CSS3al and CSS3oc2 renamed as /mSMSl and 
/iuSMS2 and C. elegans CSS3ocl2 and C. elegans CSS3a2 renamed 
as ceSMSl and ceSMS2 
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-- with human CSS3p renamed as huSMSx (SMS related), with 
mice CSS3P renamed as wioSMSl, with C. elegans CSS3p as 
ceSMSl and Drosophila CSS3B, as doSMSl 
-- with C. elegans CSS3y renamed as ceSMSdr, (SMS distantly 
5 related). 

Description of the figures with legends. 
Fig. 1. Selection and phylogenetic analysis of candidate SM synthases. 

10 (A) 

Animal entries in SwissProt/TrEMBL were searched for the presence of a 
sequence motif shared by LPPs and Aurlp proteins and then further selected 
on the basis of three additional criteria, as indicated. (B) Phylogenetic tree of 
human candidate SM synthases (CSS) and previously characterised members 

15 of the human LPP super family. (C) Phylogenetic tree of CSS proteins from 
human (Hu), mouse (Mm), Caenorhabditis elegans (Ce) and Drosophila 
melanogaster (Dm), and of Aurlp proteins from Saccharomyces cerevisiae (Sc), 
Schizosaccharomyces pombe (Sp) and Candida albicans (Co). Asterisks denote 
CSS proteins expressed in S. cerevisiae and tested for SM synthase activity. 

20 SM synthases (SMS) are marked in red. SwissProt/TrEMBL accession 

numbers of CSS proteins are: (1) Q9TYV2; (2) Q20696; (3) Q9VS60/Q9VS61; 
(4) Q96LT4; (5) Q9DA37; (6) Q86VZ5; (7) Q8VCQ6; (8) Q8NHU3; (9) Q9D4B1; 
(10) Q9XTV2; (11) Q20735; (12) Q965Q4; (13) Q9D606; (14) Q9NXE2; (15) 
Q8VCY8; (16) Q96GM1; (17) Q96MP0; (18) AAP57768; (19) Q9BQF9; (20) 

25 AAP57767; (21) Q22250; (22) Q9TXU1; (23) Q9VNT9; (24) Q9VNU1; (25) 

Q10022; (26) Q9D4F2; (27) Q8IY26; (28) Q91WB2; (29) Q96SS7; (30) Q8T8T9; 
(31) Q22461. Note that (4) Q96LT4 contains a partial protein sequence and 
that the complete ORF was deduced from a corresponding EST clone (see the 
Table). 

30 
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Fig. 2. A subset of CSS3 family members displays SM synthase activity 
upon expression in yeast. (A) Immunoblots of cells expressing various CSS 
proteins were stained with antibodies recognising the V5 epitope-tagged 
carboxy termini of CSS proteins. Control denotes cells transformed with empty 
5 vector. (B) TLC separation of reaction products generated when NBD-ceramide 
(NBD-Cer) was incubated with lysates of control or CSS-expressing cells in the 
presence (+) or absence (-) of IPC synthase inhibitor, aureobasidin A (Aba). (C) 
Metabolic labelling of cells expressing human CSS3al/SMSl or CSS3a2/SMS2 
with p 4 C]-choline and NBD-ceramide. The lipids were extracted, separated by 

10 two-dimensional TLC and analysed for fluorescence and radioactivity. Note 
that SMS1- and SMS2-expressing cells, but not control cells, synthesized NBD- 
SM, and that this NBD-SM was labelled with [ 14 C]-choline. (D) Metabolic 
labelling of cells expressing human CSS3a2/SMS2 with [ 14 C] -choline. The 
lipids were extracted, deacylated by mild alkaline hydrolysis (+NaOH) or 

15 control incubated (-NaOH) and separated by two-dimensional TLC before 
autoradiography. Note that SMS2-expressing cells synthesized alkaline- 
resistant species of [ 14 C]-choline-labelled lipids (phytoSM) that were absent in 
control cells. 

20 Fig. 3. NEM inhibits yeast-associated SMase but not mammalian SM 

synthase. (A) TLC separation of reaction products generated when NBD-SM 
(1.8 uM) was incubated with lysates of exponentially grown yeast cells (20 
ODeoo cells) in the presence (+) or absence (-) of iV-ethylmaleimide (NEM; 1 
mM). Reactions were in 0.35 ml RBI buffer containing 1 ug/ml of IPC 

25 synthase inhibitor (aureobasidin A) and incubated for 30 min at 37 * C. (B) TLC 
separation of reaction products generated when NBD-ceramide (NBD-Cer; 1,8 
mM) was incubated with lysates of HeLa cells (1/1: 2 x 10 e cells; 1/4: 0.5 x 10* 
cells) in the presence (+) or absence (-) of NEM (1 mM). Reactions were in 0.35 
ml PNS buffer, and incubated for 30 min at 37°C. Lipid extraction and TLC 

30 analysis were as described in Materials and methods. 
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Fig. 4. Detergent extracts of human CSS3a2/SMS2-expressing yeast 
cells support SM formation from bovine brain ceramides. Lines A and B show 
the reconstituted ion chromatograms of m/z 731.6, corresponding to the m/z 
ratio of protonated 18:0 SM, during the separation of molecular species of 
choline-containing phospholipids after solid phase extraction of membrane 
extracts that had been incubated with bovine brain ceramides and egg PC. The 
elution time of an authentic standard of 18:0 SM is indicated in the 
chromatogram by an arrow (42.3 minutes). Line A was derived from detergent 
extracts of human CSS3a2/SMS2-expressing cells and line B from detergent 
extracts of control cells. Mass spectra recorded at the elution time of 18:0 SM 
confirmed that formation of 18:0 SM occurred in CSS3a2/SMS2-containing 
extracts (top right panel), but not in control extracts (bottom right panel). 

Fig. 5. SM synthase activity in detergent extracts of human SMS1- or 
SMS2-expressing yeast cells is stimulated by externally added PC. Detergent 
extracts of yeast cells expressing human SMS1 or SMS2 were diluted 2- to 8- 
fold in extraction buffer, and then incubated with NBD-ceramide (1.8 \iM) in 
the presence (+) or absence (-) of externally added PC (22 (aM), as described in 
Materials and methods. Formation of NBD-SM was monitored by one- 
dimensional TLC. 

Fig. 6. Human SMSl and SMS2 function as PC:ceramide choline 
phosphotransferases. (A) Detergent extracts of yeast cells expressing human 
SMSl, SMS2 or transformed with empty vector (control) were incubated with 
NBD-ceramide (1.8 \xM) in the presence or absence of different potential head 
group donors (22 |iM), as indicated. Formation of NBD-SM or NBD-IPC was 
monitored by one-dimensional TLC and quantified as described in Materials 
and methods. Note that addition of PI stimulated formation of NBD-IPC in all 
three extracts (asterisks). PC, phosphatidylcholine; SM, sphingomyelin; Ch, 
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choline; Ch-P, phosphorylcholine; CDP-Ch, cytidine-5' diphosphocholine; PE, 
phosphatidylethanolamine; PI, phosphatidylinositol; PS, phosphatidylserine; 
PA, phosphatide acid; PG, phosphatidylglycerol. (B) Detergent extracts of 
SMS2-expressing and control cells were incubated with NBD-ceramide and 
5 [ 3 H]-chohne labelled PC. The lipids were extracted, separated by two- 
dimensional TLC and analysed for fluorescence and radioactivity. Note that 
only SMS2-expressing cells synthesized NBD-SM, and that this NBD-SM was 
labelled with [ 3 H]-choline. 

10 Fig. 7. Human SMS1 and SMS2 exhibit reverse activity. Detergent 

extracts of yeast cells expressing human SMS1, SMS2 or transformed with 
empty vector (control) were incubated with NBD-diacylglycerol (NBD-DAG; 1.8 
uM) in the presence or absence of different head group donors (22 |xM), as 
indicated. Formation of NBD-PC was monitored by one-dimensional TLC and 

15 quantified as described in Materials and methods. 

Fig. 8. Conserved sequence motifs in SMSl, SMS2 and SMSr proteins. 
(A) Alignment of human SMSl, SMS2 and SMSr amino acid sequences. 
Identical residues are highlighted in black and conservative amino acid 

20 substitutions in grey. Conserved residues within four homology motifs, 
designated Dl to D4, are highlighted in blue and conservative amino acid 
substitutions in the D2 and D4 motifs of SMSr proteins in green. Note that 
motifs D3 and D4 display similarity to the C2 and C3 domains in LPPs, with 
identical residues highlighted in red. Regions predicted to form 

25 transmembrane domains, TM1 to TM6, are marked by a black line. (B) 
Alignment of the four homology motifs in SMS and SMSr proteins from 
humans (Hu), C. elegans (Ce), P. falciparum (Pf) and D. melanogaster (Dm). 
PlasmoDB accession no. of P/SMSl and .P/SMS2 are MAL6P 1.177 and 
MAL6P1.178, respectively. Putative active site residues in consensus 
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sequences are underlined. Symbols used are n, small neutral amino acid; O, 
aromatic amino acid; b, branched amino acid; x, any amino acid. 

Fig. 9. Sequence listings 1 - 11: conserved sequence motifs of 
5 phosphotransferases as indicated in figure 8 B 

Sequence listings 12-22: polypeptide sequences of cholinephosphotransferases and 
ethanolaminephosphotransferases 



10 


12. 


human sphingomyelin synthase 1 




13. 


human sphingomyelin synthase 2 




14. 


human sphingomyelin synthase related 




15. 


Caenorhabditis elegans sphingomyelin synthase 1 




16. 


Caenorhabditis elegans sphingomyelin synthase 2 


15 


17. 


Caenorhabditis elegans sphingomyelin synthase 3 




18. 


Caenorhabditis elegans sphingomyelin synthase distantly related 




19. 


Caenorhabditis elegans sphingomyelin synthase related 




20. 


Plasmodium falciparum sphingomyelin synthase 1 




21. 


Plasmodium falciparum sphingomyelin synthase 2 


20 


22. 


Drosophila melanogaster sphingomyelin synthase related. 



Fig. 10. Human SMS1 and SMS2 are encoded by ubiquitously expressed 
genes. Northern blot analysis of SMS 1 and SMS2 transcripts (arrows) in 
various human tissues. Random prime-labelled human SMS1 or SMS2 cDNA 
25 was hybridised to a human poly(A) + RNA blot (Origene, Rockville, MD). As a 
control for loading, the RNA blot was stripped and rehybridized with a human 
fi-actin cDNA probe. Mobilities of RNA size markers are indicated. 



30 



Fig. 11. Human SMS1 and SMS2 localise to different cellular 
organelles. (A) 
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HeLa cells co-transfected with V5-tagged SMS1 and m;yc-tagged 
sialyltransferase were incubated in the presence or absence of 10 jjJM 
nocodazole for 60 min at 37°C, fixed, and then co-stained with rabbit anti-V5 
and mouse antwrtyc antibodies. Counterstaining was with FITC-conjugated 
5 goat anti-rabbit and Texas red-conjugated goat anti-mouse antibodies. (B) 
HeLa cells transfected with V5-tagged SMS2 were biotinylated on ice, fixed 
and then co-stained with mouse anti-V5 and rabbit anti-biotin antibodies. 
Counterstaining was with Texas red-conjugated goat-anti mouse and FITC- 
conjugated goat anti-rabbit antibodies. Bar, 10 jam. 

10 

Fig.12. Human SMS2 is partially associated with the Golgi. HeLa cells 
co-transfected with V5-tagged SMS2 and wyc-tagged sialyltransferase were 
incubated in the presence or absence of 10 \jM nocodazole for 60 min at 37°C, 
fixed, and then co-stained with rabbit anti-V5 and mouse anti-myc antibodies. 
15 Counterstaining was with FITC-conjugated goat anti-rabbit and Texas red- 
conjugated goat anti-mouse antibodies. Bar, 10 |im. 

Fig. 13. Membrane topology of human SMS1 and SMS2 proteins. (A) 
Schematic 

20 view of the predicted membrane topologies of V5-tagged SMS1 and SMS2, and 
the Golgi-associated type I membrane protein, p24. (B) Immunoblot of intact or 
lysed HeLa cells expressing V5-tagged SMS1 or SMS2 and pretreated with 8 
mM trypsin for 30 min at 30 9 C in the presence or absence of 0.4% Triton X100, 
as indicated. Immunoblots were stained with antibodies against the V5 epitope 

25 (a-V5) or against p24 (a -p24). 

Fig. 14. SMS1 and SMS2 are required for cell growth and survival. 
(A) SMS1, SMS2 and SMSr are expressed in human HeLa cells. Open reading 
frames of human SMS1, SMS2 and SMSr were amplified by reverse- 
30 transcriptase PCR on 5 jag total RNA isolated from HeLa cells, according to 
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instructions of the manufacturer (Titan One Tube RT-PCR, cat. No. 1855476, 
Roche Diagnostics GmbH, Mannhein, Germany). Products were separated on a 
1% agarose gel and stained with ethidium bromide. (B) HeLa cells stably 
transfected with V5-tagged SMS1 (SMSl-V5/pcDNA3.1) were treated with 
5 lamin A, SMS1 or SMS2 siRNA duplexes for 72 hr before Western blot analysis 
using anti-V5 and anti-lamin A antibodies, as described (Elbashir et al., 
Nature 411, 494-498). Lamin A, SMS1 and SMS2 were targeted with siRNA 
duplexes designed from the sequences 5'-aactggacttccagaagaacatc-3\ 5'- 
aactacactcccagtacctgg-3', and 5 , -aacccaagagcttatccagtg-3 > , respectively. (C) 

10 Hela cells stably transfected with V5-tagged SMS1 were treated with lamin A 
or SMS1 siRNA for 72 hr, and then fixed, permeabilized and co-stained with 
mouse-anti GM130 and rabbit anti-V5 antibodies. Counterstaining was with 
Texas red-conjugated goat-anti-mouse and FITC-conjugated goat-anti-rabbit 
antibodies, (D) Growth curves of Hela cells stably transfected with V5-tagged 

15 SMS1 and treated with lamin A, SMS1 and/or SMS2 siRNA duplexes in 
serum-free Optimem I medium supplemented with Glutamax I 
(GIBCO/Invitrogen, Carlsbad, CA). Cell numbers were assessed at 0, 24, 48 
and 72 hr after treatment with siRNA duplexes. 

20 

Table: EST clones and primers used in this study 
Protein Accession EST clone ID Primer pairs N- and C-terminal 
name number 1 



ffuCSSlal Q9NXE2 IRAKp961F14 AAGCCATGGCTGTAGGAAACAACAC 

32 2 GGTAACTTCGGTCATGGAC 
JTuCSSla2 Q96GM1 IRALp962F05 AAGCCATGGCGGGAGGGAGA 
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GGTGGCCACGGCGGGC 




AOCGC7 


lrCALp 962104 
172 


AAGCCATGCCAGCTTCCCAGAG 
CCAGGCAGAGATGAGCATC 


11 UVjOOOU ± 


v^OD V ZjO 


T fTWm QQ1 3 

juLxiNuioyi 0 


AAGCCATGATCCTTGTAGGACTCTG 


(SMS1) 






TGTGTCATTCACGAGCCGG 


HwCSS3a2 


Q8NHU3 


IRAKp961F14 
33 2 


AAGCCATGAATATGTTTGATGCTGACA 


(SMS2) 




ATCCTTATAAGCCCGTGTGG 


i?uCSS3p 
MmCSSla2 


Q96LT4 
Q8VCY8 


IMAGp958I13 
1239Q2 

IRAKt>961L15 

a. a. vi. *.x juj^y v v J. X-J X 

31 2 


AAGCCATGGCAGGTCCTAATCAACTC 
TCCAATTAGTCTTTTCATTATTGC 

rxrxvj \-/vjxx l VJVJu X VJVJJrl\JrVjlxftAJT 

GGTOftPPAPfu^pprtfip 


MmCSS2a 


091WB2 


IMAGn998R0 
810812 2 


A AnrP ATP.PP AnPTVTPPPA P A 

CCAGGCAGAGATGAGCATC 


MwCSS3al 


Q8VCQ6 


IRAKp961L21 


AAGCC ATO A A frfr A A (xTOOTTT A P 

fulUVvA X vJxl_Ta. V_J VJTxxxx VjT 1 VjTvJT X 1 X 






412 


TGTGTCGTTTACCAGCCGG 


CeCSSly 


Q22250 


YK289ff7 
/YK572h5 4 


AAGPPATOTPPfrTnPlP A fVPTTPn 
GTACCGATATPPATPATTTTP 


CeCSS3al 


Q9XTV2 


YK559h9 / 


AAGPPATO A AAA TflTPTTPP A A TP A fP A 


(SMS1) 




YK517el0 4 


A TTTTGGCAGAGACATGGTAG 


CeObS3a2 


Q20735 


YK428e6 / 


AAGCCATGACAAACAGTTCGGAGTTC 






YK109h8 4 


TTGCAATTTGTAGTTGATACGA 


CeCSS3B 


Q20696 


YK524b8 4 


AAGCCATGCTGGATAACAGACCTATAC 
ATTGTGCTTTTTGGTATGATTTTTG 
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1 Swiss-Prot/TrEMBL 

2 Obtained from RZPD Deutsches Ressourcenzentrum fur Genomforschung, Berlin, 
Germany 

5 3 Provided by Dr. Sumio Sugano, Institute of Medical Science, University of Tokyo, 

Japan 

4 Provided by Dr. Yuji Kohara, National Institute of Genetics, Mishima, Japan 

To achieve the targets of the invention a complementary approach was 
10 pursued for the identification of animal SM synthase that takes advantage of 

structural information available for enzymes catalyzing analogous reactions. 

In contrast to most animal cells, plants, fungi and yeast do not produce SM. 

Instead, these organisms add phosphoinositol to phytoceramide to generate 

inositol phosphorylceramide, or IPC (Dickson, 1998). IPC production in the 
15 yeast Saccharomyces cerevisiae requires the product of the AUR1 gene (Nagiec 

et aL, 1997). 

Sequence analysis of Aurlp proteins from different fungi revealed four 
conserved motifs (Heidler and Radding, 2000) two of which are similar to the 
C2 and C3 domains present in members of the lipid phosphate phosphatase 

20 (LPP) family (Waggoner et al, 1999). 

LPPs play a critical role in cell signaling by controlling the conversion of 
bioactive lipid phosphate esters such as (lyso)phosphatidic acid and 
sphingosine-l-phosphate to their dephosphorylated counterparts. The 
conserved CI, C2 and C3 domains in LPPs likely constitute the active site for 

25 cleavage of the bond between the lipid hydroxyl and phosphate groups 
(Neuwald, 1997). 

In the case of Aurlp, this reaction would represent the first step in the 
transfer of inositol phosphate from PI, with the resulting enzyme -phosphate 
intermediate being subjected to nucleophilic attack by the oxygen of ceramide 
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rather than the oxygen of water used by LPPs. The presence of LPP-like 
motifs, together with the IPC synthase activity found associated with affinity- 
purified Aurlp, suggests that Aurlp is directly responsible for IPC synthesis. 
Insects and mammals contain an ethanolamine phosphorylceramide 
5 (EPC) synthase, another phosphotransferase, that may be structurally related 
to SM synthase. 

The insect Drosophila melanogaster (Dm) does not generate SM, but instead 
produces phosphoethanolamine phosphorylceramide (EPC (Rietveld et al., (17). 
Moreover, it has been shown that the endoplasmic reticulum of mammalian 

10 cells harbours an EPC synthase (Malgat et al., 1987). Furthermore, the 
nematode Caenorhabditis elegans (Ce) generates phosphorylcholine- 
substituted glycolipids (Lochnit, G., et al., 2000) still another 
phosphotransferase, that may be structurally related to SM synthase. 
Based on the above considerations a bioinformatics and functional 

15 cloning strategy was developed to identify the enzyme responsible for SM 
synthesis in animals and to identify other cholinephosphotransferases. The 
present invention provides a surprisingly successful strategy which comprises 
a process to isolate candidates for functional genes of a previously unidentified 
enzyme with known activity from a huge database by combining at least four 

20 characteristics based on data from bio -informatics and from biochemistry, viz. 

- presence of a sequence motif shared with previously identified enzymes 
having a related function 

- biochemical function of the gene should be unknown until now 

- no structural homologues in an organism that does not contain the enzyme 
25 - ability to mediate a reaction catalysed by the unidentified enzyme upon its 

heterologous expression in an organism or cell lacking said enzyme activity. 
Preferably also the presence or non-presence of transmembrane domains 
depending on the working mechanism of the enzyme in relation to the 
membrane is considered. 

30 
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By searching the human, mouse, C. elegans and Drosophila protein 
databases using a sequence motif shared by LPPs and Aurlp homologues and 
further selection on the basis of the three additional criteria mentioned above 
and, a set of cholinephospho transferase sequences was collected as depicted in 
5 Fig. 8B. Conserved residues within a lot of homology motifs have been found. 

Therefore, the present invention further provides an isolated 
polypeptide comprising one or more of the amino acid motifs selected from the 
group consisting of a sequence of at least 80%, preferably 90 % identity to any 
of 

0 

(a)P-L-X-D-X(35,75)-R^X(8)-[YF]-X(2>R-X(6>T 



(b) C-X-D-X(3)-S-G-H-T 
15 (c) H-Y-[TS]-X-D-[V^^ 

The wording isolated means that said polypeptide has been identified in a database 
comprising a large amount of polypeptides. 

In particular said polypeptide is derived from the group consisting of 
20 Animalia, Alveolata and Kinetoplastida. Preferably, said polypeptide is selected 
from the group comprising any of the SEQ ID No. 1 - 11 or at least 70%, more 
preferably at least 80%, still more preferably at least 90% similarity to the 
conserved amino acid motifs listed therein. 

Furthermore the present invention provides a polypeptide comprising an 
25 amino acid sequence with at least 70%, preferably at least 80% and still more 
preferably at least 90% similarity to any of the SEQ ID No. 12 - 22. Each of 
these sequences comprise amino acid motifs of the SEQ ID No. 1 - 11 or 
sequences related thereto so that the similarity grade of at least 70 % to any of 
the SEQ ID No. 12 - 22 will be reached. 
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The present invention also provides a polypeptide comprising an amino 
acid sequence with at least 20 % identity to any of the SEQ ID No. 12 - 22. In 
particular, a polypeptide comprising an amino acid sequence with at least 30 % 
identity to SEQ ID No. 12 and therefore at least also including SEQ ID No. 13. 
5 Furthermore, a polypeptide is provided by the present invention 

comprising an amino acid sequence with at least 40 % identity to any of the 
SEQ ID No. 19 - 21. Finally, the invention provides a polypeptide comprising 
an amino acid sequence with at least 22 % identity to SEQ ID No. 22. 
According to a preferred embodiment of the present invention a 

10 polypeptide is provided with one or more of the activities selected from the 

group consisting of cholinephosphotransferases and ethanolaminephosphotransferases, 
preferably one or more of the activities selected from the group consisting of 
sphingomyelin synthase, ethanolamine phosphorylceramide synthase, 
phosphatidylcholine: glycoprotein cholinephosphotransferase and 

15 phosphatidylcholine iglycolipid cholinephosphotransferase. Furthermore, a 
nucleotide sequence selected from the group consisting of a nucleotide 
sequence coding for any of the amino acid sequences of a polypeptide with one 
or more of said activities and an anti sense nucleotide sequence that is 
complementary thereto is provided, together with a plasmid, a vector and an 

20 organism or micro organism and a cell line comprising said nucleotide 
sequences. 

According to a further preferred embodiment of the present invention a 
process is provided for producing sphingomyelin synthase or sphingomyelin 
comprising the expression of the nucleotide sequences coding for a polypeptide 
25 with sphingomyelin synthase activity in a (micro)organism or cell line 
comprising the nucleotide sequences coding for such a polypeptide. 
Furthermore, the use of one of more of the nucleotide sequences coding for a 
polypeptide with sphingomyelin synthase activity to influence the reaction 
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in vivo or in vitro has been provided. In this way a powerful means has been 
created to enhance the conversion of CER and PC into SM and DAG and vice 
versa. Besides this, the present invention also provides the use of one of more 
5 of the nucleotide sequences coding for a polypeptide with sphingomyelin 
synthase activity to identify or 
develop compounds influencing the reaction 



CER + PC<->SM + DAG. 

10 

According to still a further preferred embodiment of the present 
invention a process is provided for producing ethanolamine 
phosphorylceramide synthase or ethanolamine phosphorylceramide 
comprising the expression of the nucleotide sequences coding for a polypeptide 
15 with ethanolamine phosphorylceramide synthase activity in a (microorganism 
or cell line comprising the nucleotide sequences coding for such a polypeptide. 
Furthermore, the use of one of more of the nucleotide sequences coding for a 
polypeptide with ethanolamine phosphorylceramide activity to influence the 
reaction 

20 

CER + PE^-EPC + DAG 



in vivo or in vitro has been provided. In this way a powerful means has been 
created to enhance the conversion of CER and PE into EPC and DAG and vice 
25 versa. Besides this, the present invention also provides the use of one of more 
of the nucleotide sequences coding for a polypeptide with ethanolamine 
phosphorylceramide synthase activity to identify or develop compounds 
influencing the reaction 
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According to another preferred embodiment of the invention the 
application of the compounds mentioned above which can be identified or 
developed by the use of one of more of the nucleotide sequences coding for a 
5 polypeptide with sphingomyelin synthase and/or ethanolamine 
phosphorylceramide synthase in medical use, especially 
for the manufacture of medicaments treating a disease selected from the group 
consisting of cancer, metabolic diseases, for instance the Niemann Pick A 
disease and diseases caused by parasites, for instance malaria, is provided. 

10 According to a further preferred embodiment of the present invention a 

process is provided for producing phosphatidylrglycoprotein 
cholinephosphotransferase or phosphatidyl:glycolipid 
cholinephosphotransferase, phosphorylcholine-substituted glycoprotein or 
phosphorylcholine-substituted glycolipid comprising the expression of the 

15 nucleotide sequences coding for a polypeptide with phosphatidyl: glycoprotein 
cholinephosphotransferase or phosphatidyl:glycolipid 

cholinephosphotransferase activity in a (micro)organism or cell line comprising 
the nucleotide sequences coding for the corresponding polypeptide. 
"Corresponding" stands for the nucleotide sequences which code for the 

20 enzyme with the corresponding activity, so either for the enzyme with the 
phosphatidyl: glycoprotein chohnephosphotransferase activity or for the 
enzyme with the phosphatidyl:glycolipid chohnephosphotransferase activity. 
Furthermore, the use of one of more of the nucleotide sequences coding for a 
polypeptide with phosphatidyl-glycoprotein chohnephosphotransferase or 

25 phosphatidyl:glycolipid cholinephosphotransferase activity to influence the 
reaction 



glycohpid/protein + PC PC-substituted glycolipid/protein + DAG 
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in vivo or in vitro has been provided. In this way a powerful means has been 
created to enhance the conversion of glycolipid/protein and PC into PC- 
substituted glycolipid/protein and DAG and vice versa. 

Besides this, the present invention also provides the use of one of more 
5 of the nucleotide sequences coding for a polypeptide with 
phosphatidyhglycoprotein 

cholinephosphotransferase activity or phosphatidyl: glycolipid 
cholinephosphotransferase activity to identify or develop compounds 
influencing the reaction 

10 

glycolipid/protein + PC <-» PC-substituted glycolipid/protein 4- DAG. 

According to another preferred embodiment of the invention the 
application of the compounds mentioned above which can be identified or 

15 developed by the use of one of more of the nucleotide sequences coding for a 
polypeptide with phosphatidyl: glycoprotein cholinephosphotransferase or 
phosphatidyl: glycolipid cholinephosphotransferase in medical use, especially 
for the manufacture of a medicament treating a disease caused by parasitic 
nematodes, is provided. 

20 A detailed description of the various aspects of the present invention 

will be given here below. 

A novel family of integral membrane proteins likely responsible for SM 
synthesis in animal cells was identified according to the process of the present 
invention. The two human members of this family, SMSl and SMS2, were 

25 subjected to a detailed biochemical analysis and it is believed that their 

functional assignment as SM synthases is justified on the following grounds. 

First, heterologous expression of SMSl or SMS2 proves sufficient to 
support SM synthesis in the yeast S. cerevisiae, an organism lacking 
endogenous SM synthase activity (figures 2, 4 and 6). Second, in keeping with 

30 the enzymatic properties of mammalian SM synthase reported in the 



WO 2005/003336 



PCT/NL2004/000488 



20 

literature, SMS1 and SMS2 function as bi-directional lipid 
cholinephosphotransferases capable of converting PC and ceramide to SM and 
DAG and vice versa (figures 6 and 7). Third, SMS1 and SMS2 localize to 
cellular organelles previously established as the principal sites of SM 
5 synthesis, namely the Golgi (SMS1) and the plasma membrane (SMS2), figure 
11. Fourth, SMS1 and SMS2 share sequence motifs containing putative active 
site residues with lipid phosphate phosphatases and candidate PIxeramide 
inositolphosphotransferases (figure 8). Finally, SMS1 and SMS2 adopt a 
membrane topology whereby the putative active site residues are facing the 

10 exoplasmic leaflet (figure 13), hence the side of the membrane where SM 
synthesis was found to occur. 

A multiplicity of SM synthase genes appears widely spread among 
organisms generating SM. Human SMS1 and SMS2 are highly conserved in 
mammal s and evidence was found for the existence of up to three functional 

15 homologues in the nematode C. elegans. Moreover, two orthologous sequences 
in the human malaria parasite P. falciparum were identified according to the 
process of the present invention, suggesting that this lower eukaryote too 
contains different SM synthases encoded by separate genes. SM synthesis in P. 
falciparum occurs in the Golgi apparatus as well as in the network of 

20 tubovesicular membranes (TVM) emerging from the parasitophorous vacuolar 
membrane during intraerythrocytic development (Elmendorf and Haldar, 
1994). A differential sensitivity to l-phenyl-2-acylamino-3-morpholino-l- 
propanol has been taken as evidence that the Golgi- and TVM-associated 
activities correspond to different enzymes (Kauer et al., 1995). Our current 

25 findings provide a novel opportunity to test this prediction and may facilitate 
the development of new drugs against parasites as for instance malaria. 

The uniform tissue distributions of transcripts for SMS1 and SMS2 in 
humans (figure 10) suggest that most mammalian cell types would contain 
both SM synthase isoforms. Biochemical analysis of human SMS1 and SMS2 

30 thus far revealed no fundamental differences in enzymatic properties, even 
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though the possibility that substrate preferences are lost when enzymes are 
assayed in a heterologous, detergent-containing system cannot be excluded. 
For now, the most striking difference between SMS1 and SMS2 concerns their 
subcellular localisation. While SMSl seems to represent the well-known Golgi- 
5 associated SM synthase, SMS2 primarily resides at the plasma membrane. 
Hence, SMSl would be proximal to SMS2 with respect to receiving newly 
synthesized ceramide from the ER. A challenging prospect is that, whereas the 
Golgi enzyme would be responsible for generating the bulk of cellular SM, the 
second enzyme may serve a principal role in signal transduction at the plasma 

10 membrane. The activation of a sphingomyelinase (SMase) and subsequent 
liberation of ceramide at the plasma membrane has been recognised as an 
important signalling event in the regulation of fundamental cellular processes 
that include cell cycle arrest, differentiation and apoptosis (Kolesnic and 
Hannun, 1999; Pettus et al, 2002; Andrieu-Abadie and Levade, 2002). By 

15 converting ceramide back to SM, plasma membrane-associated SM synthase 
may attenuate SMase-induced signalling. This reaction would produce as a 
side product DAG, which is a signalling molecule in its own right (Brose and 
Rosenmund, 2002). 

Hence, the presence of a SM synthase at the plasma membrane complicates a 
20 proper understanding of the role of ceramide as a specific signal-transducer. 

A current bottleneck for elucidating the role of SM hydrolysis in cellular 
signalling is the unambiguous identification of the agonist-stimulated SMase. 
Both neutral and acid SMases have been implicated, but their precise cellular 
roles in SM hydrolysis and signalling remain to be clarified (Andrieu-Abadie 
25 and Levade, 2002). Since the plasma membrane-associated SM synthase is 
capable of catalysing the reverse reaction of SM synthesis (van Helvoort et al., 
1994), it is tempting to speculate that it may execute part of the signalling 
events currently attributed to ligand-induced SMase. The observation that 
both plasma membrane- and Golgi-associated SM synthases exhibit reverse 
30 activity brings up the question of how the directionality of these enzymes is 
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regulated. One possibility is that the direction of the reaction is primarily 
determined by the relative concentrations of ceramide and DAG in the 
membrane. On the other hand, given the biological importance attributed to 
DAG and ceramide, one may expect the interconversion of these molecules to 
5 be subjected to a more elaborate form of control. Indeed, it has been shown 
that activation of SM synthesis in primary astrocytes is an early event 
associated with the mitogenic activity of basic fibroblast growth factor (Riboni 
et al., 2001) and that the onset of TNFa-induced apoptosis in 
rhabdomyosarcoma cells is preceded by TNF-dependent inhibition of SM 

10 synthesis (Bourteele et al., 1998). Mammalian SMS1 contains a predicted SAM 
(sterile alpha motif) domain at the amino-terminus (residues 7-63 in human 
SMS1) that might provide a means for the enzyme to interact with regulatory 
proteins (Schultz et al., 1997). The present identification of a family of animal 
SM synthases offers unprecedented opportunities to further clarify the 

15 biological significance of SM metabolism and its regulation. 

SM synthases contain four highly conserved sequence motifs (figure 8). 
Two of these, D3 and D4, are similar to the C2 and C3 phosphatase domains in 
LPPs and include the putative catalytic histidine and aspartate residues 

20 implicated in LPP-mediated hydrolysis of lipid phosphate esters (Neuwald, 
1997) This suggests a working mechanism for SM synthases in which the 
catalytic triad previously described for LPPs would function in a similar 
fashion. The reaction would proceed via the following steps: (1) binding of a 
two-chain choline-phosphohpid (PC or SM) to a unique binding site, (2) 

25 nucleophilic attack on the lipid-phosphate ester bond by the histidine in D4 
assisted by the conserved aspartate in this motif, (3) formation of a 
cholinephospho-histidine intermediate and release of DAG or ceramide, 
facilitated by the histidine in D3 acting as a base, (4) nucleophilic attack of the 
Cl-hydroxyl of ceramide or DAG on the cholinephospho-histidine intermediate, 

30 assisted by the histidine in D3, (5) release of SM or PC from the active site to 
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allow another round of catalysis. This model provides a framework for site- 
directed mutagenesis and kinetic studies to reveal the working mechanism of 
SM synthases. 

There is a striking parallel between animal SM synthase and IPC 
synthase in yeast. Like human SMSl, the yeast enzyme is Golgi localised and 
contains the key D4 motif in the Golgi lumen at the start of the last membrane 
span (Levine et al., 2000). This suggests that animal SM synthase and yeast 
IPC synthase may have evolved from a common ancestor. In contrast, the SM 
synthase recently identified from Pseudomonas aeruginosa (Luberto et al, 
2003) lacks the key motifs of animal SM synthase and is a soluble, rather than 
an integral membrane protein. Hence, it appears that SM synthases arose at 
least twice during evolution. 

Database searches using the catalytic site sequences from LPPs yielded, 
in addition to SM synthases, numerous other proteins containing partially 
conserved phosphatase motifs. In many cases, the biochemical functions of 
these proteins are not known. 

Given the present findings, the possibility that some of these LPP-like 
proteins do not serve as phospholipases but instead catalyse novel kinds of 
synthetic or transphosphatidylation reactions deserves consideration. For 
example, mammals and insects synthesize ethanolamine phosphorylceramide, 
a sphingolipid analogous to SM 

(Malgat et al., 1986; Rietveld et al., 1999). Whether production of EPC and SM 
involves a single transferase that accepts both PC and PE as substrates or 
requires two separate enzymes has remained an open issue. However, 
Drosophila lacks SM and generates only EPC (Rietveld et al., 1999). Our 
finding that Drosophila contains an SMS-related protein, but no SMS proteins, 
strongly suggests that the SMS-related proteins described in this study 
function as dedicated EPC synthases. This possibility is currently under 
further investigation. 
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Phosphorylcholine (PC), a small haptenic molecule, is found in a wide 
variety of prokaryotic organisms, i. e. bacteria, and in eukaryotic parasites 
such as nematodes, as well as in fungi. Linked to parasite-specific glycoprotein 
glycans or glycolipids, it is assumed to be responsible for a variety of 

5 immunological effects, including invasion mechanisms and long-term 

persistence of parasites within the host. Numerous reports have indicated 
various effects of PC-substituted molecules derived from parasitic nematodes 
on signal transduction pathways in B and T lymphocytes, displaying a highly 
adapted and profound modulation of the immune system by these parasites. 

10 The Nematoda, comprising parasitic and free-living species (e.g. C. 

elegans), can be regarded as promising prototypic systems for structural 
analyses, immunological studies and biosynthetic investigations. In this 
context, Ascaris suum, the pig parasitic nematode, is an ideal organism for 
immunological studies and an excellent source for obtaining large amounts of 

15 PC-substituted (macro)molecules. C. elegans, as a completely genome- 
sequenced species and expressing parasite analogous PC-substituted 
structures, together with the possibility for easy in vitro cultivation, represents 
a conceptual model for biosynthetic studies, whereas filarial parasites 
represent important model systems for human pathogens, especially in 

20 developing countries. Current knowledge on the tissue-specific expression of 
PC epitopes, structural data of glycoprotein glycans and glycosphingolipids 
bearing this substituent and biological implications for the immune systems of 
the respective hosts are reviewed by Lochnit et al., 2000. Since the elucidated 
structures of these compounds are clearly different from those present in the 

25 host, the biosynthesis of nematode-specific, PC-substituted compounds would 
be a promising target for the development of new anthelmintic strategies. 
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In the present invention it has been shown that modulating the activity 
of an enzyme of the group of enzymes identified as sphingomyelin synthases, 
ethanolamine phosphorylceramide synthases, 
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phosphatidylcholine: glycoprotein cholinephosphotransferase and 
phosphatidylcholine:glycolipid cholinephosphotransferase (further referred to 
as the group of enzymes) in a cell has effects on cell growth and/or survival. By 
lowering the activity in cells, the cells respond by a reduction in cell growth, 
5 increased sensitivity toward apoptotic stimuli and/or reduced cell survival. 
Upregulation of the activity induces cell growth, rejuvenation of the cells 
and/or a reduced sensitivity toward apoptotic signals. An effect is, of course 
preferably relative to the behaviour of normal cells. These cells typically have 
effective but not saturating levels of an activity of an enzyme of the group of 

10 enzymes and preferably of a sphingomyelin synthase. The possibility of 
detecting a change in the activity of an enzyme of the group, that can be 
correlated to a particular amino acid sequence has led the inventors to 
methods for determining whether a compound is capable of modulating an 
enzymatic activity displayed by a cell, said activity comprising an activity of an 

15 enzyme of the group of enzymes identified as sphingomyelin synthases, 
ethanolamine phosphorylceramide synthases, 
phosphatidylcholine: glycoprotein cholinephosphotransferase and 
phosphatidylcholine:glycolipid cholinephosphotransferase, said method 
comprising providing said cell with a nucleic acid encoding a polypeptide 

20 according to the invention, contacting said cell with said compound and 
determining whether said enzymatic activity is modulated. In a preferred 
embodiment said method comprises detecting whether said activity is 
inhibited. Thus the invention further provides the use of an inhibitor of a 
enzyme activity of the group, preferably a sphingomyelin synthase of the 

25 invention, as a cell death promoter. Preferably said cell is a parasitic cell or a 
mammalian tumor cell, preferably a human tumor cell. The compound capable 
of modulating said enzyme activity displayed by the cell, can act on the level of 
expression of the enzyme, wherein a higher expression results in upregulation 
of the enzyme activity displayed by the cell and wherein a lower expression 

30 results in downregulation of the enzyme activity displayed by the cell. 
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Alternatively, the compound can act on the activity of the enzyme in the cell, 
i.e. the compound can increase the activity of the enzyme and thereby 
upregulate the enzymatic activity displayed by the cell, or the compound can 
decrease the activity of the enzyme and thereby downregulate the enzymatic 
5 activity displayed by the cell. 

In a preferred embodiment of a method of the invention, said enzyme activity 
of the group comprises a sphingomyelin synthase activity. A polypeptide of the 
invention comprising sphingomyelin synthase activity preferably comprises an 

10 amino acid sequence having at least 80% and preferably at least 90% identity 
to an amino acid sequence according to formula (a), (b) or (c). In a preferred 
embodiment said polypeptide of the invention comprising sphingomyelin 
synthase activity preferably comprises an amino acid sequence having at least 
80% and preferably at least 90% identity to an amino acid sequence according 

15 to formula (a), (b) and (c). In a particularly preferred embodiment a 

polypeptide of the invention comprising sphingomyelin synthase activity in a 
mammalian cell (a mammalian sphingomyelin synthase) and having the above 
mentioned identity to the formulas, further comprises at least 50% and 
preferably at least 90% sequence identity with huSMSl (Seq. ID 12) or 

20 huSMS2 (Seq. ID 13) depicted in figure 8. In a particularly preferred 

embodiment a polypeptide of the invention comprising sphingomyelin synthase 
activity in a nematode cell (a nematode sphingomyelin synthase) and having 
the above mentioned identity to the formulas, further comprises at least 50% 
and preferably at least 90% sequence identity with ceSMSl (Seq. ID 14), 

25 ceSMS2 (Seq. ID 15) or ceSMS2 (Seq. ID 16) depicted in figure 8. A 

polypeptide of the invention comprising sphingomyelin synthase activity in a 
Plasmodium cell (a Plasmodium sphingomyelin synthase) and having the 
above mentioned identity to the formulas, preferably further comprises at least 
30% and preferably at least 90% sequence identity with pfSMSl (Seq. ID 17) 

30 or pfSMS2 (Seq. ID 18) depicted in figure 8. In a particularly preferred 
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embodiment a polypeptide of the invention comprising ethanolamin 
phosphorylceramid synthase activity in a mammalian cell (a mammalian 
ethanolamin phosphorylceramid synthase) and having the above mentioned 
identity to the formulas, further comprises at least 50% and preferably at least 
5 90%, sequence identity with huSMSr (Seq. ID 19). In a particularly preferred 
embodiment a polypeptide of the invention comprising ethanolamin 
phosphorylceramid synthase activity in a nematode cell (a nematode 
ethanolamin phosphorylceramid synthase) and having the above mentioned 
identity to the formulas, further comprises at least 50% and preferably at least 

10 90% sequence identity with ceSMSr (Seq. ID 21) depicted in figure 8, A 
polypeptide of the invention comprising ethanolamin phosphorylceramid 
synthase activity in an insect cell (an insect ethanolamin phosphorylceramid 
synthase) and having the above mentioned identity to the formulas, preferably 
further comprises at least 50% and preferably at least 90% sequence identity 

15 with dmSMSr (Seq. ID 20) depicted in figure 8. A polypeptide of the invention 
comprising PC:glycolipid/glycoprotein cholinephosphotransferase activity in a 
nematode cell (an nematode PCrglycolipid/glycoprotein 
cholinephosphotransferase) and having the above mentioned identity to the 
formulas, preferably further comprises at least 30% and preferably at least 

20 90%, sequence identity with ceSMSdr (Seq. ID 22) depicted in figure 8. 

In one embodiment of the invention a polypeptide of the invention 
comprising sphingomyelin synthase activity preferably comprises an amino 
acid sequence having at least 80% and preferably at least 90% identity to an 
amino acid sequence according to formula (a), (b) or (c). In a preferred 

25 embodiment said polypeptide of the invention comprising sphingomyelin 

synthase activity preferably comprises an amino acid sequence having at least 
80% and preferably at least 90% identity to an amino acid sequence according 
to formula (a), (b) and (c). In a particularly preferred embodiment a 
mammalian polypeptide of the invention comprising sphingomyelin synthase 

30 activity and having the above mentioned identity to the formulas further 
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comprises at least 50% and preferably at least 90%, sequence identity with 
huSMSl (Seq. ID 12) or huSMS2 (Seq. ID 13) depicted in figure 8. 

By providing a cell with a polypeptide of the invention having an 
5 enzyme activity of the group one produces a cell having more activity of said 
enzyme than before. This allows for early and easier detection of the effect that 
compounds have on the activity displayed by the cell. In a preferred 
embodiment a polypeptide of the invention having activity of an enzyme of the 
group is provided to a cell that did not contain this activity in detectable 

10 amounts. Such cells may be generated by inactivating the endogenous genes 
coding for the polypeptide or cells can be used from species and/or genera that 
do not contain this activity. Non-limiting examples of such species and/or 
genera in case of sphingomyelin synthase activity are: drosophila and yeast. 
Thus in a preferred embodiment the method ulitizes a cell that was deficient in 

15 an enzyme activity of the group, which is preferably a sphingomyelin synthase 
activity, prior to providing the cell with the polypeptide of the invention. 

A cell can be any type of cell. A cell can be part of an organism, where one or 
more of the cells of the organism are provided with (nucleic acid encoding) said 
20 polypeptide. The cell can also be a cell of a micro-organism. The micro- 
organism can be bacterial and is preferably a eukaryotic micro-organism. 
Yeasts and fungal micro-organisms are preferred. 

A polypeptide of the invention preferably comprises a sequence as depicted in 
25 figure 8 or a functional part, derivative and/or homologue thereof. A functional 
part, derivative or homologue of a sequence as depicted in figure 8 comprises 
at least the same enzyme activity in kind not necessarily in amount. A 
functional part preferably comprises at least 80% of the amino sequence of a 
sequence as depicted in figure 8. A functional derivative preferably comprises 
30 not more than 5 and preferably not more than 3 amino acid substitutions 
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compared to a sequence of figure 8. In a preferred embodiment said 
substitutions are conservative substitutions. A derivative can also comprises 
one or more deletions and/or insertions of amino acids when compared with a 
sequence as depicted in figure 8. A functional homologue is a sequence derived 
5 from a different species as the species depicted in figure 8, but wherein said 
homologue shares a common ancestor. A functional homologue preferably 
comprises an amino acid sequence of formula (a), (b) and (c). In a preferred 
embodiment said polypeptide comprises a sequence as depicted in figure 8A or 
a functional part, derivative and/or homologue thereof. In one preferred aspect 
10 of the invention said polypeptide is derived from a Plasmodium. Preferably 
having an amino acid sequence as depicted in figure 8B or a functional part, 
derivative and/or homologue thereof. 

The compound can be any chemical or biological compound. In one embodiment 
15 said compound has an effect on expression of a polypeptide of the invention in 
a cell. Such a compound can be an expression cassette comprises an 
expressible coding region for said polypeptide. It can also comprise for instance 
a compound capable of modulating the activity of a promoter for the gene 
encoding said polypeptide in said cell. In a preferred embodiment said 
20 compound is capable of preventing translation of mRNA encoding said 

polypeptide in said cell. Translation is preferably inhibited through anti-sense 
RNA or RNAi. In a particularly preferred embodiment translation is inhibited 
through RNAi. RNAi is a collective term for a response of cells to double 
stranded RNA. A variety of different types of compounds are able to induce the 
25 specific degradation of mRNAs. None of those different types of compounds are 
herein excluded. In a preferred embodiment the compound comprises siRNA 
(Elbashir et al., Nature 411, 494-498) or a hairpin RNA. A hairpin RNA can be 
introduced directly into the cell or be expressed by a gene encoding the 
hairpin. 



30 



WO 2005/003336 



PCT/NL2004/000488 



30 

In another aspect the invention provides the use of a nucleic acid 
encoding a polypeptide according to the invention, as a probe. In a preferred 
aspect the invention provides the use of an oligonucleotide specific for a nucleic 
acid sequence encoding a polypeptide as depicted in figure 8 or a functional 
5 part, derivative and/or homologue thereof, for detecting said sequence. 

Detection of the sequence can be for any purpose and is preferably for 
the purpose of assessing whether a cell comprises an enzyme activity of the 
group of enzymes mentioned above, preferably sphingomyelin synthase 
activity. 

10 

In another aspect the invention provides use of a nucleic acid encoding a 
polypeptide of the invention, preferably comprising a nucleic acid sequence 
encoding a polypeptide as depicted in figure 8 or a functional part, derivative 

15 and/or homologue thereof for enhancing cell survival and/or cell growth. It has 
been found that increasing the enzymatic activity displayed by a cell of an 
enzyme of the group of enzymes, has the effect that the secretion pathway of 
the cell is stimulated. This in yet another aspect the invention provides a 
method for at least in part improving the yield of an secretion product of a cell 

20 comprising providing said cell with a polypeptide of the invention, or a nucleic 
acid such a polypeptide, preferably comprising a nucleic acid sequence 
encoding a polypeptide as depicted in figure 8 or a functional part, derivative 
and/or homologue thereof. In a preferred embodiment said polypeptide 
comprises sphingomylin synthase activity. In this embodiment said cell is 

25 preferably a mammalian cell or a cell of a eukaryotic micro-organism. 

A method for determining whether a compound is capable of modulating 
an enzymatic activity displayed by a cell, said activity comprising an activity of 
an enzyme of the group of enzymes identified as sphingomyelin synthases, 
30 ethanolamine phosphorylceramide synthases, 



WO 2005/003336 



PCT/NL2004/000488 



31 

phosphatidylcholinerglycoprotein cholinephosphotransferase and 
phosphatidylcholine:glycolipid cholinephosphotransferase can be performed 
using any method for the detection of said enzymatic activity. In a preferred 
embodiment the method for detecting said enzyme activity comprises 
5 providing said cell or a fraction thereof with a labelled substrate for said 

enzyme. Said substrate is preferably a substrate for sphingomyelin synthase. 
The method preferably further comprises harvesting sphingolipid from said 
cell or said fraction and detecting labelled sphingolipid. Detection of said 
labelled sphingolipid is preferably done using (thin layer) chromatography or 

10 mass spectrometry. 

Polypeptides of the invention are associated with a memebrane of the 
cell. The localization varies with the particular polypeptide. It has been 
observed that this property can be used to generate chimearic polypeptides 
having sequences from at least two polypeptides of the invention which have 

15 an altered distribution in the cell when compared to at least one of the 
polypeptides said chimearic polypeptide is derived from. Such chimearic 
polypeptides and nucleic acids encoding the chimearic polypeptides are 
therefore also included in the invention. Thus the invention provides a method 
for targeting a first polypeptide according to the invention to a different 

20 cellular compartment comprising providing a cytosolic part of said first 

polypeptide with a cellular compartment localization signal of a cytosolic part 
of a second polypeptide according to the invention, wherein said first and said 
second polypeptide, when unmodified, reside in different cellular 
compartments. Chimearic polypeptides wherein the at least two original 

25 polypeptides reside in essentially the same cellular compartment are also 
included in the invention. It is also possible to attach known cellular 
localization signals to a polypeptide of the invention. Such known cellular 
localization signals, for instance, an endoplasmatic reticulum retention signal 
do not have to be derived from a polypeptide of the invention. The cellular 

30 localization signal is preferably present in the cytosolic C-terminal end of a 
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polypeptide of the invention. Thus preferably the cytosolic C-terminal end of 
said first polypeptide is provided with the with a cellular compartment 
localization signal of a cytosolic part of the second polypeptide. Preferably said 
cytosolic part of said second polypeptide comprises the C-terminal end of a 
5 polypeptide of the invention. Another way to target a polypeptide of the 
invention to a different compartment is to remove the cellular localization 
signal from the cytosolic part (preferably the C-terminal cytosolic part) of the 
polypeptide. 

The polypeptide can be targeted to any cellular compartment having a 

10 membrane and be active there. In a preferred embodiment said cellular 

compartment comprises a plasma membrane, an endosomal compartment, a 
Golgi, an endoplasmatic reticulum or a combination thereof. Cellular 
localization signals do not have be absolute in that there can be cases wherein 
detectable amounts of polypeptide are spread over at least two cellular 

15 compartments. A polypeptide is said to be targeted to a different compartment 
when, compared to the original cellular localization of one of the polypeptides 
of the invention, the chimearic polypeptide is detectable in at least on other 
compartment or is not detectable in a compartment where one of the single 
polypeptides is detectable. A special case of targeting is where not the 

20 localization but rather the relative distribution over compartments changes. 
For instance, when an percentage X of a polypeptide of the invention is 
normally present in a first compartment and a percentage Y is present in a 
second compartment. In this example targeting is said to have taken place 
when the percentage in at least one of said at least two compartments is 

25 altered by more than 30%. Preferably, the change is reciprocal. 

Considering that the cellular compartment localization signal of a 
polypeptide of the invention is predominantly determined by a C-terminal 
cytosolic part of said polypeptide it is preferred that said cellular compartment 
localization signal of the C-terminal cytosolic part of said second polypeptide 

30 replace the C-terminal cytosolic part of said first polypeptide. 
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It will be clear to the skilled in the art that by the present invention 
yielding purified cholinephosphotransferases and 
ethanolaminephosphotransferases a new field of development for new 
5 compounds with valuable properties as for instance drugs has been made 
available. The present invention will further be elucidated by the following 
examples without in any way restricting the broad scope thereof. 

Materials and methods 

10 Chemicals 

Sphingosyl-(NBD-hexanoyl)-phosphocholine (NBD-SM) and (NBD- 
hexanoyl)-ceramide (NBD-Cer) were from Molecular Probes (Eugene, OR) and 
01eoyl-(NBD-hexanoyl)-phosphocholine (NBD-PC) from Avanti Polar Lipids 
(Alabaster, AL). [methyl- 14 C]-choline was from ICN Biomedicals (Irvine, CA) 

15 and L-3-phosphatidyl [N-methyl- 3 H]«choline,l,2-.dipalmitoyl from Amersham 
Pharmacia (Piscataway, NJ). NBD-DAG was produced from NBD-PC by 
treatment with phospholipase C from B. cereus as described (Trotter, 2000). 
Egg L-a-phosphatidylcholine, lyso-phosphatidylcholine, L-a- 
phosphatidylethanolamine, L-ct-phosphatidylglycerol, L-a-phosphatidylserine, 

20 soybean L-oc-phosphatidylinositol, dioctanoyl L-ot-phosphatidic acid, SM (from 
bovine brain), choline, phosphocholine, cytidine 5'-diphosphocholine, O- 
tricyclo[5.2.1.0 2 ' 6 ]dec-9-yl dithiocarbonate potassium salt (D609), N- 
ethylmaleimide (NEM) and phosphohpase C (Bacillus cereus) were from Sigma 
Aldrich (St. Louis, MO). 



25 
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Selection, cloning and expression of CSS sequences 

Animal entries in the Swiss-Prot/TrEMBL protein sequence database 
were searched using Prosite (http://www.expasy.org/tools/scanprosite) against 
sequence motif H-[YFWH]-X 2 -D-[VLI]-X2-[GA]-X3-[GSTA] and then further 
5 selected based on three additional criteria as described under Results. 
Candidate SM synthases (CSS) were assembled into different groups by 
multiple sequence alignments and phylogenetic trees generated with ClustalW 
and PHYLIP. Open reading frames (ORFs) in selected CSS sequences were 
PCR amplified using Taq polymerase (MBI Fermentas, Hanover, MD) 

10 according to information provided in Table I. PCR products were cloned into 
yeast expression vector pYES2.1/V5-His-TOPO (Invitrogen Corporation, 
Carlsbad, CA) and the resulting plasmids used to transform S. cerevisiae 
strain IAY11 (MATa ura3-52 his3A200 leu2-3, -112 trpA901 ade2-101 ade3- 
A853, provided by Ian Adams, MRC-LMB, Cambridge, UK). Transformants 

15 were grown in complete minimal uracil dropout medium containing 2% (w/v) 
galactose. Expression of CSS proteins was verified by Western blot analysis 
with mouse anti-V5 antibodies (Invitrogen). PCR amplified ORFs of human 
SMS1 and SMS2 were cloned into mammalian expression vector 
pcDNA3.1/V5-His-TOPO (Invitrogen) and the resulting plasmids (SMSl- 

20 V5/pcDNA3. 1 and SMS2-V5/pcDNA3. 1) used to transfect HeLa cells. 

SM synthase assay on cell ly sates 

CSS-expressing yeast cells were lysed at 100 OD6oo/ml by bead bashing 
25 at 4 0 C in RBI buffer (120 mM K-glutamate, 15 mM KC1, 5 mM NaCl, 2 mM 
MnCl 2 , 2 mM MgCl 2 , 20 mM Hepes-KOH, pH 7.2) containing freshly added 
protease inhibitors (Holthuis et al, (1998). 

Lysates were centrifuged at 700 g for 10 min at 4 0 C and supernatants 
(0.25 ml per reaction) preincubated with NEM (1 mM) for 10 min at 37 0 C. The 
30 reaction was initiated by addition of 5 nmol of NBD-Cer in 0.1 ml RBI buffer 
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and incubation continued for 15 min. Lipids were extracted by addition of 1.7 
ml chloroform:methanol (1:2.2), dried under N2 and subjected to butanol/water 
partitioning. Lipids recovered from the butanol phase were separated by one- 
dimensional TLC for 30 min in solvent I: chloroform/methanol/4.2 M NH4OH 
(9:7:3, v/v), followed by 30 min in solvent II: chloroform/methanol/acetic acid 
(45:30:7, v/v). Fluorescent images were recorded on a STORM 860 Imaging 
Analysis System (Molecular Dynamics, Sunnyvale, CA) and analysed with 
ImageQuant software. 

Metabolic labelling 

CSS-expressing yeast cells (0.5 ODeoo) were inoculated in 5 ml medium 
containing 10 fiCi [methyl- l *C] -choline and 10 nmol NBD-Cer. Cells were 
grown for 16 h at 30 0 C, washed in water, and lipids were extracted by bead 
bashing in H20/methanol/chloroform (5:16:16, v/v). The organic extracts were 
dried under N2, subjected to butanol/water partitioning, and lipids recovered 
from the butanol phase were deacylated by mild base treatment using 0.2 N 
NaOH in methanol (60 min, 30 0 C). After neutralizing with 1 M acetic acid, 
lipids were extracted with chloroform and separated by two-dimensional TLC 
using solvent I in the first, and solvent II in the second dimension. Radio- 
labelled lipids were detected by exposure to BAS-MS imaging screens (Fuji 
Photo Film Co., Japan) and read out on a BIO-RAD Personal Molecular 
Imager. 

Preparation of detergent extracts 

SMSl/SMS2-expressing yeast cells were lysed at 100 ODeoo/ml by bead 
bashing at 4° C in lysis buffer (50 mM Tris-HCl, pH 7.0, 1 mM EDTA, 0.3 M 
sucrose) containing protease inhibitors. Post nuclear supernatants were 
prepared as above and loaded onto a 60% (w/w) sucrose cushion and then 
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centrifuged at 100.000 g for 60 min at 4 ' C. Membranes derived from ± 500 
ODeoo of cells were resuspended in 1 ml ice-cold extraction buffer (50 mM Tris- 
HC1, pH 7.0, 10% glycerol, 1% Triton X-100, 1 mM MnCl 2 ) containing protease 
inhibitors, incubated for 60 min at 4 0 C while rotating and then centrifuged at 
5 100.000 g for 60 min at 4 * C. Detergent extracts were diluted to a final protein 
concentration of 1.5 mg/ml in extraction buffer and 100 pi aliquots were snap 
frozen in liquid N2 and stored at -80 * C. 

SM synthase and reverse transferase assays on detergent extracts 

10 

Detergent extracts were diluted 5- to 8-fold in extraction buffer, and 50 
pi mixed with 5 pi of a 1.2 mM head group donor stock (e.g. CDP-choline, PC, 
PI) or with 15 pCi (350-700 fmol) of [ 3 H]-cholme-containing PC prepared in 
extraction buffer. Following a 15 min preincubation at 37° C, reactions were 
15 started by addition of 220 pi RB2 buffer (50 mM Tris-HCl, pH 7.0, 5 mM 
MgCl 2 , 1 mM MnCl 2 , 1 mM NEM) containing 5 nmol NBD-ceramide (SM 
synthase assay) or 5 nmol NBD-DAG (reverse transferase assay) and 
incubated for 60 min at 37 °C. Lipids were extracted, separated by TLC, and 
analysed as above. 

20 

Mass spectrometry 

Detergent extracts (400 pi) were mixed with bovine brain ceramide (180 
pM) and egg PC (500 pM) in a total volume of 500 pi. After 15 min at 37 0 C, 

25 reactions were diluted 4-fold in RB2 buffer and then incubated for another 60 
min at 37 0 C. Lipids were extracted in chloroform:methanol (1:2.2) and 
subjected to butanol/water partitioning. Residual Triton X-100 was removed by 
solid phase extraction on Si-60 columns (500 mg stationary phase) in acetone. 
Phospholipids were eluted in methanol/chloroform (19:1, v/v), dried under N 2 , 

30 dissolved in methanol/chloroform (2:1, v/v) and then separated by HPLC on 
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two 250 x 4.6 mm Lichrosphere RP-18 end-capped columns (Merck, 
Darmstadt, Germany) in series (Brouwers et al., 1998). The column effluent 
was split in a 10:1 ratio, with the smaller fraction going to the mass 
spectrometer. Mass spectrometry was performed on a Sciex API-365 triple 
5 quadrupole mass spectrometer (Applied Biosystems, Nieuwerkerk a/d IJssel, 
The Netherlands). Positive ions were generated by a turbo-ion spray ionisation 
source operating at +5.5 kV ionisation potential. N2 (2 1/min) was used as 
drying gas at a temperature of 350 0 C. The declustering potential (cone 
voltage) was set to 45V and the focus potential to 220V. Mass spectra from 
10 mass to charge ratio (m/z) 650 amu to 950 amu were recorded at a speed of 120 
amu/s. 

Mammalian cell transfection and immunofluorescence microscopy 

15 HeLa cells were cultured in DMEM medium containing 10% fetal calf 

serum. Cells were grown on glass coverslips to 40% confluence and then 
transfected with human SMSl-V5/pcDNA3.1, SMS2-V5/pcDNA3.1 and myc- 
tagged sialyltransferase/pCB7 constructs using Lipofectamine 2000 
(Invitrogen). After 48 h, cells were fixed in 3% paraformaldehyde/PBS and 

20 processed for immunofluoresence microscopy (see Supplementary materials for 
details). For cell surface biotinylation, cells were incubated twice with 0.5 
mg/ml sulfo-NHS-SS-biotin (Pierce) in PBS for 20 min on ice, quenched in 10 
mM glycine/PBS for 20 min and then washed in PBS prior to fixation. 
Immunostaining was with rabbit anti-V5 antibodies (Sigma), mouse anti-V5 

25 antibodies (Invitrogen), mouse 9E10 anti-myc antibody (SanverTech, CA), 

rabbit anti-biotin antibodies (Rockland, Gilberts ville, PA), FITC- or Texas Red- 
conjugated goat anti-rabbit and goat anti-mouse antibodies (Jackson 
Laboratories, West Grove, PA). Images were obtained using a Nikon D- eclipse 
Cl confocal miscroscope. 



30 
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Protease protection assay 

HeLa cells transfected with human SMS1-V5 or SMS2-V5 constructs 
were washed, scraped and then lysed in ice-cold PNS buffer (120 mM K- 
5 glutamate, 15 mM KC1, 5 mM NaCl, 2 mM MnCl 2 , 0.8 mM CaCl 2 , 2 mM 

MgCl 2 , 1.6 mM EDTA and 20 mM Hepes-KOH, pH 7.2) by 20 passages trough 
a 26 3 /4 gauge needle. The homogenate was treated with 8 mM trypsin (Sigma 
Aldrich) in the presence or absence of 0.4% Triton X-100 for 30 min at 37 e C. 
Alternatively, cells were only washed and then trypsin-treated in PNS buffer 
10 as above. Homogenates and cell suspensions were transferred on ice, trypsin 
inhibitor (Sigma Aldrich) was added to 160 mM and samples were processed 
for Western blot analysis using mouse anti-V5 antibodies and rabbit anti-p24 
protein (Gommel et al., 1999, provided by B. J. Helms, Faculty of Veterinary 
Medicine, Utrecht). 

15 

SDS-PAGE and Western Blot Analysis 

Protein extracts were prepared in SDS/Urea sample buffer (20 mM Tris- 
HC1, pH 6.8, 1 mM EDTA, 4.5 M urea, 2.5% SDS, 0.01% bromophenol blue) 
containing fresh protease inhibitors (Holthuis et al., (1998). Extracts were 
20 heated for 10 min at 50 ° C, centrifuged briefly at 14,000 g and then resolved by 
SDS-PAGE on 10% polyacrylamide mini-gels. For Western blotting, 
nitrocellulose transfers were blocked for 90 min in PBS, 5% Protifar (Nutricia, 
Zoetermeer, The Netherlands), 0.2% Tween-20 (Blotto). Primary antibody 
incubations were performed for 1 h in Blotto. Detection was with horsheradish 
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peroxidase-conjugated goat anti-rabbit or anti-mouse IgG, using enhanced 
chemiluminescence (Amersham Pharmacia Biotech). 
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Immunofluorescence Microscopy 
HeLa cells were grown on coverslips, transfected with human SMSl- 
V5/pcDNA3.1, SMS2-V5/pcDNA3.1 and /nyc-tagged sialyltransferase/pCB7 
(Sprong et al., 2001). 48 h after transfection, cells were fixed in 3% 
paraformaldehyde/PBS for 30 min at room temperature, quenched in PBS 
containing 50 mM NrkCl, and then blocked and permeabihzed for 1 h in PBS 
containing 0.5% BSA, 0.1% saponine (blocking buffer). Cells were labelled with 
mixtures of primary antibodies (1:200) for 1 h in blocking buffer, washed for 45 
min in blocking buffer with three buffer changes, and then incubated for 20 
min with 10% goat serum in blocking buffer. For immunostaining of intact 
cells, incubation with primary antibodies occurred before fixation and was 
performed for 1 h at 4° C in PBS containing 0.5% BSA. Cells were 
counterstained with fluorescently labelled secondary goat antibodies (1:75) for 
30 min in blocking buffer, washed in blocking buffer for 45 min with three 
buffer changes, and then rinsed briefly in PBS and water. Mounting was in 
Mowiol 4-88 (Calbiochem, La Jolla, CA) containing 2.5% 1,4- 
diazabicyclo[2.2.2.]octane (Sigma). 
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Example 1 

Identification of candidate SM synthases from the animal database 

Figure 1A shows an outline of the bioinformatics approach used to 
5 identify candidate sequences for SM synthases from the animal database. 
Candidate SM synthases were identified based on the following criteria: 1) 
presence of a sequence motif, H-[YFWH]-X 2 -D-[VLI]-X2-[GA]-X3-[GSTA], 
shared by previously characterised LPPs and Aurlp homologues; 2) 
biochemical function should be unknown; 3) no structural homologues in S. 

10 cerevisiae, since this organism lacks SM; 4) presence of multiple (>2) 

transmembrane domains, since the enzyme mechanism is intramembranous 
and because LPPs and Aurlp proteins have 6 predicted membrane spans. 
Sequences conforming to all 4 criteria were subsequently used as queries in a 
BLAST search to track down homologous sequences that were missed in the 

15 initial search due to deviations in the LPP/Aurlp motif. This approach yielded 
9 human, 9 mouse and 9 C. elegans sequences that could be grouped into three 
major protein families, designated CSS1, CSS2 and CSS3 (for candidate SM 
synthase family 1-3; Figure IB, C). Except for the presence of a common 
sequence motif, CSS proteins displayed no significant sequence similarity to 

20 Aurlp proteins or to LPP family members with known biochemical functions. 
However, human CSSl(3l is identical to PRG1, a neuron-specific candidate 
phosphatide acid phosphatase with a role in axon growth and regenerative 
sprouting (Brauer et aL, 2003) 

Database accession numbers of CSS proteins are listed in the legend of Figure 
25 1. 
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Example 2 

A subset of CSS3 family members displays SM synthase activity 

5 To investigate whether any of the three CSS families indeed contained 

SM synthases, the open reading frames of human, mouse and C. elegans 
members for which full-length cDNAs could be obtained were cloned into a 
yeast multicopy, GAL1 promoter plasmid in frame with a caxboxy-terminal V5 
epitope. The resulting plasmids were used to transform wild-type yeast and 

10 the transformants were shifted to galactose-containing medium to induce 
expression of recombinant proteins. Expression of proteins was verified by 
Western blot analysis using anti-V5 antibodies (Figure 2A). Thus, 13 of the 27 
selected CSS sequences were expressed and analysed for SM synthase activity 
(marked by asterisks, Figure 1C). To this end, yeast cells expressing CSS 

15 protein were lysed and incubated with fluorescent Ce-NBD-ceramide (NBD- 
Cer), a known substrate of mammalian SM synthase. Incubations were 
performed in the presence of N-ethylmaleimide (NEM), a potent inhibitor of 
yeast-associated SMases that does not affect mammalian SM synthase (Figure 
3). Reaction mixtures were next subjected to one-phase lipid extraction and the 

20 lipids separated by one-dimensional TLC. 

Figure 2B shows that in lysates of control cells NBD-Cer was converted 
exclusively into NBD-IPC. The same was true for lysates of cells expressing 
members of the CSS1 and CSS2 protein families. However, in lysates of cells 
expressing human CSS3ccl, human CSS3ot2, C. elegans CSS3al or C. elegans 

25 CSS3a2, NBD-Cer was converted to a second product with an Rf value 
identical to that of NBD-SM. The identity of this product as NBD-SM was 
confirmed by electrospray ionisation tandem mass spectrometry (ESI-MS/MS; 
data not shown). Moreover, metabolic labelling of cells expressing human 
CSS3ccl or CSS3a2 with [ 14 C]-choline in the presence of NBD-Cer resulted in 

30 the production of radio-labelled NBD-SM (Figure 2C). When NBD-Cer was 
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omitted, we noticed that CSS3oc2-expressing cells generated small amounts of 
[ 14 C]-choline-labelled lipids that were absent in control cells and distinct from 
PC and lysoPC. In addition, these lipids were resistant to mild alkaline 
hydrolysis, suggesting that they represent phytoceramide-based SMs 
5 (phytoSM; Figure 2D). Importantly, ESI-MS revealed that bovine brain 

ceramides incubated with detergent extracts of CSS3a2-expressing cells, but 
not control cells, are converted to SM (Figure 4). Together, these results 
indicate that CSS3cc proteins do not only use NBD-Cer, but also recognise 
naturally occurring ceramides as substrates for SM synthesis. 

10 Human CSS3al and CSS3a2 share 57% sequence identity and are 

highly conserved in mammals (human - mouse > 90%). Based on the results 
presented herein, we propose to rename these proteins SMSl and SMS2, 
respectively. C. elegans CSS3cd (ceSMSl) and CSS3a2 (ceSMS2) share 22-27% 
sequence identity with human SMSl and SMS2, and apparently represent 

15 their functional counterparts in the nematode (Figure 2B). C. elegans contains 
a third, CSS3a-related protein, termed ceCSS3a3. Although we have not 
tested this protein for SM synthase activity, the finding implies that C. elegans 
is equipped with three independent SM synthases (Figure 1C). BLAST 
searches for orthologous sequences identified two proteins in the human 

20 malaria parasite Plasmodium falciparum (PlasmoDB accession no. 

MAL6P1.177 and MAL6P1.178; see also below). Hence, a multiplicity of SM 
synthase genes appears a general feature of organisms generating SM. 

Apart from CSS3ct/SMS proteins, the CSS3 family contains a second 
cluster of CSS3P or SMS-related (SMSr) proteins with members in humans, 

25 mice, C. elegans, and Drosophila (Figure 1C). Human SMSr is highly 

conserved in mammals (human-mouse: 95%), shares > 40% sequence identity 
with its orthologues in C. elegans and Drosophila, and is about 34% identical 
to human SMSl and SMS2. Heterologous expression of human or C. elegans 
SMSr did not yield any detectable SM synthase activity (Figure 2B, right 

30 panel). The C. elegans CSS3y protein, renamed SMSdr, is only distantly 



WO 2005/003336 



PCTYNL2004/000488 



44 

related to SMS and SMSr proteins (<22% identical; see also Figure 1C) and its 
ability to mediate SM synthesis was not tested. 

Example 3 

5 Human SMS1 and SMS2 function as PCrceramide cholinephosphotransferases 

with reverse activity 

SM synthesis in anim als proceeds by the liberation of phosphorylcholine from 
PC and its subsequent transfer onto the primary hydroxyl of ceramide. To 
10 establish whether human SMS1 and SMS2 function as PCrceramide 

cholinephosphotransferases, their enzymatic characteristics were analysed in 
more detail. Hence, different donors of choline-P were tested as substrates for 
SM synthesis. To this end, detergent extracts of cells expressing human SMS1 
or SMS2 were incubated with NBD-Cer and the formation of NBD-SM was 
15 monitored by TLC. A dilution of extracts proved necessary to render SMS- 
mediated synthesis of NBD-SM dependent on externally added head group 
donors (Figure 5). Under these conditions, free phosphorylcholine (Ch-P) and 
CDP-choline (CDP-Ch) did not support SMSl- or SMS2-mediated SM 
synthesis (Figure 6A), PC, on the other hand, was efficiently recognised as 
20 substrate. SM itself was also used as a donor of the phosphorylcholine group. 
Lyso-PC was a very poor substrate. These results suggest that human SMSl 
and SMS2 are transferases that require two fatty chains on the choline-P 
donor molecule in order to be recognised efficiently as a substrate. 

25 Example 4 

The reaction of human SMSl or SMS2 with non-choline phospholipids 

We next investigated the possibility that human SMSl or SMS2 would 
use non-choline phospholipids as substrates. None of the phospholipids tested 
30 (PE, PI, PS, PA, PG) other than PC supported SM formation (Figure 6A). 
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Moreover, when extracts of cells expressing SMS2 were incubated with PC 
containing [ 3 H]-choline, the formation of radiolabelled NBD-SM was observed 
(Figure 6B). Hence, it appears that SMS1 and SMS2 directly and specifically 
recognise the choline head group on their substrates. 

5 

Example 5 

The reaction of sphingomyelin and diacyl glycerol 

Previous work suggested that mammalian SM synthase is also capable 

10 of catalysing the reverse reaction, namely the formation of PC from SM and 
DAG (Marggraf and Kanfer, 1984; van Helvoort et al., 1994). 

To investigate whether this was also the case for human SMS1 and 
SMS2, extracts of cells expressing these proteins were incubated with NBD- 
DAG and the formation of NBD-PC monitored by TLC. Addition of SM induced 

15 NBD-PC formation in extracts of SMS1- or SMS2-expressing cells, but not in 
control cell extracts (Figure 7). Strikingly, PC itself proved more efficient than 
SM in stimulating SMS1- or SMS2-dependent NBD-PC formation. In contrast, 
addition of non-choline phospholipids (e.g. PI) had no effect. These data 
suggest that human SMS1 and SMS2, rather than functioning strictly as SM 

20 synthases, are transferases capable of using PC and SM as phosphocholine 
donors to produce PC or SM, dependent on the relative concentrations of DAG 
and ceramide as phosphocholine acceptors, respectively. 

Mammalian SM synthase was found to be sensitive to the bacterial PC- 
phospholipase C inhibitor, D609 (Luberto and Hannun, 1998). 

25 In lysates of SMS 1- or SMS2-expressing yeast cells, D609 inhibited SM 

synthesis in a dose-dependent manner. The extent of inhibition observed for 
SMSl-mediated SM synthesis (50% at 100 g/ml D609) was comparable to that 
reported for mammalian SM synthase (Luberto and Hannun, 1998). SMS2 
proved two-fold less sensitive to the drug. 
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Hence, human SMS1, SMS2 and the mammalian SM synthase activity 
described in the literature share many enzymatic characteristics. 

Example 6 

5 Structure and expression of human SMS1 and SMS2 

Figure 8A shows a sequence alignment of human SMSl, SMS2 and 
SMSr, Hydrophobicity analysis predicted six membrane-spanning alpha 
helices connected by hydrophilic regions that would form extramembrane 

10 loops. A comparative analysis with SMS and SMSr sequences from mice, C. 
elegans, P. falciparum and Drosophila revealed that the number and spacing 
properties of transmembrane helices are well conserved. In addition, SMS 
proteins contain four highly conserved sequence motifs, designated Dl, D2, D3 
and D4 (Figure 8A and B, residues highlighted in blue). Motifs D3 (C-G-D-X3- 

15 S-G-H-T) and D4 (H-Y-T-X-B-V-Xz-Y-Xe-F-X 2 -Y-H) are similar to the C2 and 
C3 motifs in LPPs (shared residues highlighted in red) and include the 
histidine and aspartate residues (underlined) that form a catalytic triad 
mediating the nucleophilic attack on the lipid phosphate ester bond Neuwald 
(1997). 

20 As in LPPs, these residues are juxtaposed to transmembrane segments 

4 and 6 of the SMS proteins and consequently would be oriented toward the 
same side of the membrane. This would suggest that motifs D3 and D4 are 
part of the catalytic site responsible for liberating cholinephosphate from PC 
during SM synthesis. Motifs Dl (P-L-P-D) and D2 (R-R-X 8 -Y-X2-R-X 6 -T), on the 

25 other hand, appear entirely unique to SMS proteins and are located in the first 
extramembrane loop and third transmembrane helix, respectively (Figure 8A). 
The SMSr proteins found in humans, mouse, Drosophila and C. elegans each 
contain exact copies of the Dl and D3 motifs, yet exhibit one or more conserved 



WO 2005/003336 



PCT/NL2004/000488 



47 

amino acid substitutions in motifs D2 and D4 (Figure 8A and B, residues 
highlighted in white on grey. 

Previous work showed that some members of the mammalian LPP 
superfamily are expressed in only a limited set of tissues (Waggoner et al., 
5 1999, Brauer et al, 1997). 

Since differences in tissue distribution would provide a possible explanation 
for the existence of two different SM synthase isoforms in mammals, we 
investigated the expression profiles of human SMSl and SMS2. As shown in 
Figure 10, Northern blot analysis detected the presence of a low abundant 3.8 
10 kb transcript for SMSl in human brain, heart, kidney, liver, muscle and 

stomach. A major 1.9 kb transcript for SMS2 was expressed to a similar level 
in all of the above human tissues. These results suggest that human SMSl and 
SMS2 are encoded by ubiquitously expressed genes. 



Example 7 

Subcellular localisation and membrane topology 

SM synthesis occurs in the Golgi complex as well as at the plasma 
membrane of mammalian cells. Endosomes have been put forward as another 
major site of SM synthesis (Kallen et al., 1994) but this has been disputed (van 
Helvoort et al., 1994). 

It is not known whether SM synthase activity detected at these locations 
is due to the presence of more than one iso-enzyme in the cell. This led us to 
examine the subcellular distribution of V5-tagged versions of human SMSl 
and SMS2 in transfected HeLa cells using immunofluorescence microscopy. As 
shown in Figure 11A, SMSl was concentrated in the perinuclear region where 
it displayed extensive colocalisation with sialyltransferase, a marker of trans 
Golgi cisternae. This colocalisation was also observed in cells treated with 
nocodazole, a drug causing fragmentation of the Golgi by disrupting the 
microtubular network, hence confirming the association of SMSl with the 
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Golgi apparatus. SMS2 displayed a different localisation pattern and was 
primarily concentrated at the plasma membrane (Figure 11B). A portion of 
SMS2 was also found in the perinuclear region where it colocalised with 
sialyltransferase (see Figure 12). This Golgi-associated pool of SMS2 unlikely 
5 represents newly synthesized material en route to the cell surface, since it was 
also observed in cells after a 4-hour chase with cycloheximide. There was no 
substantial colocalisation of SMS2 with markers of the endosomal/lysosomal 
system (e.g. EEA1, CD63, internalised transferrin; data not shown). Whether 
SMS2 cycles between the Golgi and the plasma membrane, and thereby passes 
10 through endosomes remains to be established. 

Since SM synthesis takes place in the exoplasmic leaflet of the Golgi and 
the plasma membrane (Futerman et al., 1990; van Helvoort et al., 1994), the 
putative catalytic residues in motifs D3 and D4 of SMSl and SMS2 would be 
oriented toward the Golgi lumen and cell surface, respectively, whereas their 
15 termini would be located on the opposite, cytosolic side of the membrane 

(Figure 13A). To test this prediction, we investigated the sidedness of the V5- 
tagged COOH termini of human SMSl and SMS2 by protease protection 
analysis. When HeLa cells transfected with the SMS2-V5associated type I 
membrane protein, p24 construct were trypsinised, the V5 tag remained 
20 intact, unless cells were lysed or treated with detergent prior to incubation 
with the protease (Figure 13B). Trypsinisation of lysates from SMS1-V5 
expressing cells in the absence of detergent resulted in a complete removal of 
the V5 tag. Under these conditions, the Golgi-associated type I membrane 
protein, p24 (Gommel et al., 1999). 

was largely protected. Collectively, these results indicate that the COOH 
termini of SMSl and SMS2 are cytosolic. Consequently, the putative active 
site residues in these proteins would be positioned on the exoplasmic leaflet 
(Figure 13A), hence where SM synthesis was found to occur. Together, these 
findings demonstrate yet another level of similarity between SMSl, SMS2 and 
the SM synthase activity previously described in mammalian cells. 
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FIG. 9 

Sequence listings 

Sequence listings 1-11: conserved sequence motifs of phosphotransferases as indicated i: 

figure 8 B; 

sequence listing 1: partial polypeptide sequences of hu SMS1 
sequence listing 2: partial polypeptide sequences of hu SMS2 
sequence listing 3: partial polypeptide sequences of ce SMS 1 
sequence listing 4: partial polypeptide sequences of ce SMS2 
sequence listing 5: partial polypeptide sequences of ce SMS3 
sequence listing 6: partial polypeptide sequences of/?/SMS 1 
sequence listing 7: partial polypeptide sequences ofp/SMS2 
sequence listing 8: partial polypeptide sequences of hu SMSr 
sequence listing 9: partial polypeptide sequences of dm SMSr 
sequence listing 1 0: partial polypeptide sequences of ce SMSr 
sequence listing 1 1 : partial polypeptide sequences of ce SMSdr 



Sequence listing 12: /zwSMSl 



MKEVVYWSPKKVADWLLENAMPEYCEPLEHFTGQDLINLTQEDFKKPPLCRVSSDN 

GQRLLDMIETLKMEHHLEAHKNGHANGHLNIGVDIPTPDGSFSIKIKPNGMPNGYRK 

EMIKIPMPELERSQYPMEWGKTFLAFLYALSCFVLTTVMISVVHERVPPKEVQPPLPD 

TFFDHFNRVQWAFSICEINGMlLVGLWLIQWLLLKYKSIISRRFFCrVGTLYLYRCITM 

YVmPVPGMHFNCSPKXFGDWEAQLRRIMKUAGGGLSITGSHNMCGDYLYSGHTV 

MLTLTYLFIKEYSPRRLWWYHWICWLLSVVGIFCILLAHDHYTVDVVVAYYITTRIJF 

WWYHTMANQQVLKEASQMNLLARVWWYRPFQYFEKMVQGIVPRSYHWPFPWPV 
VHLSRQVKYSRLVNDT 
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Sequence listing 13: /7«SMS2 

•MDIIETAKLEEHLENQPSDPTNTYARPAEPVEEENKNGNGKPKSLSSGLRKGTKKYPD 

YIQlAMPTESRNKFPLEWWKTGlAFIYAVFNLVLTTVMrrVVHERVPPKELSPPLPDKF 

FDYIDRVKWAFSVSEINGIILVGLWITQWLFLRYKSIVGRRFCFIIGTLYLYRCITMYVT 

TLPVPGMHFQCAPKLNGDSQAKVQRILRLISGGGLSITGSHILCGDFLFSGHTVTLTLT 

YLFIKEYSPRHFWWYHLICWLLSAAGUCILVAHEHYTIDVIIAYYITTRLFWWYHSMA 

NEKNLKVSSQTNFLSRAWWFPIFYFFEKNVQGSIPCCFSWPLSWPPGCFKSSCKKYSR 
VQKIGEDNEKST 



Sequence listing 14: ceSMSl 

MKMSWNHQYTNYGSIADDNGDEEKAENSEGAAAEKVEKQHDDDGVVVHEETDGV 
ASSRSSHHDKQKPGETKKSGDGKMDDDDnTTARSSSRRICGSAASSSDSETADDAPL 
LPDEGPSHAVRLEMPGDKPASPHDRFPKTPLKTLVAFLMLVVAAAGNTITLSWII-IER 
YPLTPPLPDIVFEUPKIPWGLRLCENLM1GSFVSLLVLILFHRHRWIVLRRLCFIGSILYG 
MRCITMMVTPVPKADEDFECSPRFGENATFSLIVMRGVWSMFGLGLNLFDNQKVVL 
CGD YIYSGHTLVLVVSALFIGEYSPRRFYILHWLSWLVCSVGVIFLVI.SHGHYTIDVIL 
SYFACTRVFWAYHTQAAHPS1RLSVQNHQAKEFWFPLLRWFEGD1RRPVPRRFDCPIS 
YS QTPVD YS FLP WFSFFFQSNPMMFPGAFTF YHTQNHSKSTMSLPK 



Sequence listing 15: ceSMS2 

MTNSSEFTDVLQSRDPCVSNGIVINIDPIDPEPTPIRKEFTCEDTFHHEHHGNSEGFKTL 

TAFLCLMLSAFLNFFLLTVIHDVVPRQPLPDLTFMnPQQRWAWSVGDVLSTVSSVVA 

FTI1FLHHQRWIVLRRTFLLGAIMYGLRAVILGVTFLPPSFHNRDEICQPQVNRTAMYG 

MEIATR1-LTYVITLGLTSGQDRILCGDLMFSGHTWLTIMYFVQLQYTPRGLVILRYIA 

APITFLGrAALVVSGGHYTMDVLlAYWLTSHVFWSYHQIFEMRKDDRPQAPLSRLW 

WFWLCYWFESDVADGBCLVNKWNWPLEGPQRMHTIMNRINYKLQ 
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Sequence listing 16: ceSMS3 

MGSVSKTVISARGASPDDEQNGTKNGISNGSEWAKCIFLFFFLFIAGMSNWAVLAYT 

HDYVPRESLPDIVFSLVSEQRWASSLGDFCVALCIVMLGAJLLV1HQHRGTILKRVVFC 

AGTLYAMRSVTLAATQLPSGYTDNQGRCRDQVESEAGVFFGRLFEQTIRIGFQSFCDQ 

MLCGDLLFSGHTLVMVTCSLAVAYYLPKSIKPLQWVSHVACUGMICMTISRTHYTID 

VVIAYWLSNMVFRMYHAYCEVDMCMERRKSILYSWWPCRIVDWLEQDIVPGRLEN 

RCQLPWRRSTPRGQERGGASAESSDSSVTMCDN1TTSHHQKHVSISSSSTYPLPC 



Sequence listing 17: pJSMSl 
MIEGhnNANTTHQNTTNNKIEKNFIKEKDII^ 

DKHYNHKDSYSDNKGKIKTLRSNFSDIQDEVIITNIGHYFEGQEN1KKSVEIKNCPYKK 

KMLKIFLRRLVCATIIGIIGICIQCYFITLSDTYYKTGDEPLKDRLHEIFKEIPAFMNTPFV 

NGSIMFFLAITLLRFGLFCPLLLSITILIRiU.MLSFIYCfRSFFIYVTTLPCPIPTCQPLKHKT 

LVENLYTFYLIITAQVYECTDLVISGHTAFTTLLTFFWFFYERNIYVKTTIFLYSIYIYIim 

SRFHYTVDVLMGYVFGGSVFLFYHYLVDVAARRYALNTSVFPQTYGFSGSFTDRFQ 

VFQYFIRAITYLEALDHRMNFALSYDKEWNCFCSCVPVNKNSLLIKKKNVRNEEYYD 

FSDHFYHSYAGNGTYNLSTIRNIIKEFKRFFGMNKKN 



Sequence listing 18: ;?/SMS2 

MKKYEYFYSPIQLLHLILLLRPILYENYTYBEhnDKSSIGTDMYEINMNRKMSNISISRN 

STINEEETLSEYRLCKILLIKLMFALLFLLIALIIQGFFMIYSDSYYKSNTQPLSDRIHDLF 

GNPPKWISYKLSNTLIAILTLSFLKIILFNSIYLSIAIICRFLYIVGSFYIIRGLLIYVTSLPAT 

LETCLPLESGNFLFNLLQIIKINTNLVYVCADLIVSGHSFSTTIFLMFSFYYINNVnKFIIF 

TFSCFIYAIIUGFmYTSDVLLGIIFGVFMFSFYHlMLDISSQYYIFNKLFEIKIISNNKNIHA 

KPFFLRFFVARIFFKnPYLEGLNYTLDYAlNKNNDLSTFCNCDHDNNKIPLFSFYKPITE 

DKIIINYSDHLYHSYAGDGTINFLFWKFLKTIKKGLHK 
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Sequence listing 19: AMSMSr 

JyLAGPNQLCIRRWTTKHVAVWLKDEGFFEYVDILCNlCHRLDGITLLTLTEYDLRSPPL 

EIKVLGDIKRLMLSVRKLQKIHIDVLEEMGYNSDSPMGSMTPFISALQSTDWLCNGEL 

SHDCDGPITDLNSDQYQYMNGKNKHSVRRLDPEYWKTILSCIYVFIVFGFTSFIMVIV 

HERVPDMQTYPPLPDIFLDSVPRPWAFAMTEVCGMILCYIWLLVLLLHKHRSILLRRL 

CSLMGTVFLLRCFTMFVTSLSVPGQHLQCTGKIYGSVWEKLHRAFAIWSGFGMTLTG 

VHTCGDYMFSGHTVVLTMLNFFVTEYTPRSWNFLHTLSWVLNLFGIFF1LAAHEHYSI 

DVFIAFYITTRLFLYYHTLANTRAYQQSRRARIWFPMFSFFECNVNGTVPNEYCWPFS 

KPAIMKRLIG 



Sequence listing 20: dmSMSr 

MCDGEIGDPVTQPRSEGGGLVTMDQETRTHYLDAATDKHLTNGSPDPEPVDPLLVA 

QWSIENVTSWATCMEHFSRTLLDCLRQEAJDGEVLLSLTEEDVRDMRYKLGYKLTFG 

ELKKFWIAVLKLQLLVKNSSAESVILGIECHGNGNSVYMPLASTGCGPPSSSTCPCPQ 

AECPSYVSDCDTYLRMGGRYVPPEYFKTAMSLGYSFVVTWITSLTMVIVHERVPDM 

KRYPPLPDIFLDNVPHIPWAFNMCEITGSLLFTIWVVVLTFHKYRLVLLRRFFALAGTV 

FLLRCVTMLITSLSVPGTHLQCSQKDFAIDDPhTVDMVGALIIRMSRAYRIWSGLGMSI 

QGVRTCGDYMFSGHTVALTLLNFFITEYTPRNLYFLHTLTWLLNMFGIFFILAAHEHY 

SIDVFVAFYITSRLFLYYHTLANNRALMQSDSKRTRVWFPMFSYFESSVDGMVPNEY 

DTLGSLIDGIIEQIFKAKDQLAISVKRCWLDAPLSGNSSAHMFGGESEQCLRNGTASA 

AFFSPHQSLIGGLGGQRSQTHLNSAQSGSSTPTPATSAPTKSLPTQKKTFRDASVDPFS 

RTTFAAVQQAENVKDSPMKEKKHL 



Sequence listing 21: ceSMSr 

MLDNRPIPADPNEWRCEDVGNWLKKIGMAK1YADLIAMKHKVDGKCLLALTDTDLK 
DPPVSINCLGDIKKILFAIEFLSQKVVEIGNSGVHHRSTPNGMGPSLKNSKDGLLVEYN 
EQNHLSISGEDVYTTTRRAErVEDEETLLDTLAKSSDGTSTVQLISREElIRQVERPDTY 
FKSVAKLLIAFAYSSLSFLMTSFVMVLVHDRVPDTKTYPPLPDIVLDNVPHIPWAFDM 
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CETIGLVLAWWFTVLFFHNQRVIVARRMFSLLGTVFLLRCFTMUTSLSVPG1HLQCE 

ARPNTmQEKLHKAFHIWSNLGMSLHGVRSCGDYMFSGHTTVrTMISHFITEyTPAD 

WTGLHTFTWVLNCFAIFLILAAHEHYSIDVF1AFYISSRMFLYYHAYAYNHAGITATD 

YRMRTWFPLGWFFEYGSQGKVENEFSLPJKIRIPRRVFFABCSEEPKITPKSDSSRKRSS 
VVAAKQNGNSKNHTKKHN 



Sequence listing 22: <?eSMSdr 



MCKSCENYGKFFFAMLKICIDNSSNFAMKKDAENENLLHAYEHDEQSSKYFIKPDSR 

YSIDDDISVKIDGKQLEVRKWPTLLATAMVGVGWLSNEVALAWVHERVPDDYHPLP 

DLFFSHFPEVKJERLKIFGILGKFEKSEKKTSEFSSPIRGAIRIAEYIMMILLISALLVMFTH 

QHRW1VIRRVFFCIAMAYSFRALCVTIFQVPVPS1NTYCAPKSNSSLELVAGRVVKMF 

WSAGIEQLRPRELCGDLIVSGHTLTIFTAFLVFICTYAPQRLQPLSHIYHVLAFTALFSIL 

LARKHYMIDIVLGYTVSTR1FMEYHALAASYHNRTFETNPLAWSFWSFFIPIFECDAPA 

NMHNHLLLYNRSTSSKNVSTLKKSRRSFE 



